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Abstract 

A parameterized problem consists of a classical problem and an additional component, 
the so-called parameter. This point of view allows a formal definition of preprocessing: 
' Given a parameterized instance (/, fc), a polynomial kernelization computes an equivalent 

. instance (/', k') of size and parameter bounded by a polynomial in k. We give a complete 

' classification of Min Ones Constraint Satisfaction problems, i.e., Min Ones SAT(r), with re- 

spect to admitting or not admitting a polynomial kernelization (unless J\fV C co-A^P/poly). 
For this we introduce the notion of mergeability. If all relations of the constraint language F 
are mergeable, then a new variant of sunflower kernelization applies, based on non-zero- 
ff^ ' closed cores. We obtain a kernel with 0(fc''+^) variables and polynomial total size, where d 

. is the maximum arity of a constraint in F, comparing nicely with the bound of 0{k'^~^) 

vertices for the less general and arguably simpler d-HiTTiNG Set problem. Otherwise, any 
, relation in F that is not mergeable permits us to construct a log-cost selection formula, 

r \ ' i.e., an n-ary selection formula with O(logn) true local variables. From this we can con- 

struct our lower bound using recent results by Bodlaender et al. as well as Fortnow and 
Santhanam, proving that there is no polynomial kernelization, unless TV'P C co-A/'P/poly 
and the polynomial hierarchy collapses to the third level. 
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^ ■ 1 Introduction 

lO ■ Preprocessing and data reduction are ubiquitous, especially in the context of combinatorially 
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hard problems. Of course, it is a commonplace that there can be no polynomial-time algo- 



I rithm that provably shrinks every instance of an A/'P-hard problem, unless V = AfV. Still, 

• there does in fact exist a formal notion of efficient preprocessing, coming from the field of pa- 

, rameterized complexity. There, problems are considered with an additional component, the 

so-called parameter, intended to express the difficulty of a problem instance, e.g., solution size, 
nesting depth, or treewidth. This way preprocessing can be defined as a polynomial-time map- 
ping K : (/, /c) I— > {I',k') such that {I,k) and {I',k') are equivalent and k' as well as the size 
5^ I of /' are bounded by a polynomial in the parameter k; K is called a polynomial kernelization. 

Parameterized complexity originated as a multivariate analysis of algorithms, motivated by the 
huge difference in (often trivial) n^^^'f versus f{k)n^ algorithms, the latter having a much better 
scalability. Kernelization is one possible technique to prove fixed-parameter tractability (i.e., 
the existence of an f{k)n'^ algorithm). Indeed it is known that a problem is fixed-parameter 
tractable if and only if it admits a kernelization (see [lO]). However, this relation does not 
imply kernelizations with a polynomial size bound; achieving the strongest size bounds or at 
least breaking the polynomial barrier is of high interest. Consider for example the list of im- 
provements for Feedback Vertex Set, from the first polynomial kernel [7], to cubic [1], and 
now quadratic |20j ; the existence of a linear kernel is still an open problem. Recently a seminal 
paper by Bodlaender, Downey, Fellows, and Hermelin [5] provided the first polynomial lower 
bounds on the kernelizability of some problems, based on hypotheses in classical complexity. 
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Using results by Fortnow and Santhanam [13], they showed that so-cahed compositional param- 
eterized problems admit no polynomial kernelizations unless MV C co-AA'P/poly; by Yap |21j . 
this would imply that the polynomial hierarchy collapses. The existence of such lower bounds 
has sparked high activity in the field (see related work below). 

Constraint satisfaction problems (CSP) are a fundamental and general problem setting, 
encompassing a wide range of natural problems, e.g., satisfiability, graph modification, and 
covering problems. CSPs are posed as restrictions, called constraints, on the feasible assignments 
to a set of variables. The constraints are applications of relations from a given constraint 
language T to tuples of variables. The complexity of deciding feasibility of a CSP or finding 
an assignment that optimizes a certain goal varies according to the constraint language. E.g., 
consider Clique as a Max Ones SAT({-i2; V ^y}) problem, which is hard to approximate 
and also W[l]-complete when parameterized by the size of the clique. Khanna et al. [13] 
classified Boolean CSPs according to their approximability, for the questions of optimizing 
either the weight of a solution (Min/Max Ones SAT problems) or the number of satisfied or 
unsatisfied constraints (Min/Max SAT problems). We study the kernelization properties of Min 
Ones SAT(r), parameterized by the number of true variables, and classify these problems into 
admitting or not admitting a polynomial kernelization. We point out that Max SAT(r), as a 
subset of Max SNP (cf. [14]), admits polynomial kernelizations independent of T |15j . 

Related work In the literature there exists an impressive list of problems that admit polyno- 
mial kernels (in fact often linear or quadratic); giving stronger and stronger kernels has become 
its own field of interest. We name only a few results for problems that also have a notion of 
arity: 0{k'^~^) universe size for Hitting Set with set size at most d [l], 0{k'^~^) vertices for 
packing k vertex disjoint copies of a d-vertex graph [18], and 0{k'^) respectively 0{k'^^^) base 
set size for any problem from MIN F+IIi or MAX NP with at most d variables per clause [15]. 

Let us also mention a few lower bound results that are based on the framework of Bodlaen- 
der et al. [H]. First of all, Bodlaender et al. [5] provided kernelization-preserving reductions, 
which can be used to extend the applicability of the lower bounds. Using this, Dom et al. [9] 
gave polynomial lower bounds for a number of problems, among them Steiner Tree and 
Connected Vertex Cover. Furthermore they considered problems that have a k'l'^'^'> kernel, 
where k is the solution size and d is a secondary parameter (e.g., maximum set size), and showed 
that there is no kernel with size polynomial in k + d; for, e.g.. Hitting Set, Set Cover, and 
Unique Coverage. Fernau et al. p!2] showed that Leaf Out Branching does not admit a 
polynomial kernelization, while Rooted Leaf Out Branching does. They express that this 
gives a Turing kernelization for Leaf Out Branching, by creating one kernel for each choice 
of the root. In [TB] the present authors show that a certain Min Ones CSP problem does not 
admit a polynomial kernel and employ this bound to show that there are TC-fiee edge deletion 
respectively edge editing problems that do not admit a polynomial kernel. 

Our work We give a complete classification of Min Ones SAT(r) problems with respect to 
admittance of polynomial kernelizations. Apart from the hardness dichotomy due to Khanna et 
al. |14j . we distinguish constraint languages F by being mergeahle or containing at least one re- 
lation that is not mergeable. For the first case, we provide a new polynomial kernelization based 
on non- zero -closed cores. For the latter we show that Min Ones SAT(r) is either polynomial- 
time solvable or does not admit a polynomial kernelization, unless MV C co-TVP/poly. 

Structure of the paper We introduce some basic notation and the notion of mergeability 
in Sections [2] and [3j Sections S] and [5] then form the main part of this work, i.e., the general 
polynomial kernelization for Min Ones SAT(r) when all relations of F are mergeable, and the 
lower bound for constraint languages that contain at least one relation that is not mergeable. 
We conclude in Section EJ with a discussion of implications as well as open problems. 
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2 Boolean Constraint Satisfaction Problems 



A constraint is an application of a relation R to a tuple of variables (xi, . . . ,Xr), requiring 
that R{xi, . . . , Xr) holds, allowing repeated variables (e.g., R{x,x,y)). A constraint language 
is a set F of relations; we shall require throughout that every constraint language T is finite, 
and contains only relations over the boolean domain. A formula J- over F is a conjunction of 
constraints using relations R € T, and V{J^) denotes the set of variables that occur in J^. An 
assignment to the variables of J- satisfies J- if every constraint in holds under the assignment. 
The weight of an assignment is the number of variables that it sets to true. Fixing a finite set F 
with relations over the boolean domain defines a Min Ones SAT(F) problem: 

Input: A formula over a finite constraint language F; an integer k. 
Parameter: k. 

Task: Decide whether there is a satisfying assignment for of weight at most k. 

As an example, if R{x,y) = {(0, 1), (1, 0), (1, 1)}, then Min Ones SAT(ii) is the well-known 
problem Vertex Cover. The approximation properties of such problems have been classified 
by Khanna et al. [T3]; in particular, we have the following. 

Theorem 1 ([14J). Let F be a finite set of relations over the boolean domain. IfT is zero-valid, 
Horn, or width-2 affine (i.e., implementable by assignments, {x = y), and {x / y)), then Min 
Ones SAT(r) is in V; otherwise it is MV -complete. 

SAT(F) denotes the problem of deciding whether any satisfying assignment exists; the clas- 
sical complexity of these problems was classified by Schaefer [T9j, and the parameterized com- 
plexity, for the question of finding a satisfying assignment with exactly k true variables, has 
been classified by Marx [17j. The problem Min Ones SAT(F) is fixed-parameter tractable for 
every finite F, by a simple branching algorithm; see |17j . 

We need to define a number of types of constraints. Let F be a finite set of relations 
over the boolean domain. We say that F implements a relation i? if i? is the set of satisfying 
assignments for a formula over F, i.e., R{xi, . . . , Xr) = Aj ^ii^Hj • • • > ^it) where each Ri £T (we 
do not automatically allow the equality relation unless =€ F). A positive clause is a disjunction 
of (non-negated) variables. A negative clause is a disjunction of negated variables. We say 
that a constraint is zero-valid if a tuple of zeros satisfies it. A constraint is Horn if it can be 
implemented by disjunctions containing at most one unnegated variable each, dual Horn if it 
can be implemented by disjunctions containing at most one negated variable each, and IHSB- 
(Implicative Hitting Set Bounded-) if it can be implemented by assignments, implications, and 
negative clauses. These constraint types can also be characterized by closure properties. For 
two tuples a = (ai, . . . , a^.), 13 = (/3i, . . . , Pr)., let a A /3 = (ai A . . . , A /3r), and likewise 
for aV and write a < (3 if ai < (3i for every 1 < i < r (where and 1 are used for false and 
true values, respectively). We then have that a constraint R is Horn if and only if it is closed 
under intersection, i.e., if a,/3 G R, then a A P & R, and a constraint is dual Horn if and only 
if it is closed under disjunction. Likewise, a constraint R is IHSB- if and only if it is closed 
under an operation a A (/3 V 7) for tuples a, /3, 7 in R. See [8j for more on this. A constraint 
language F is zero-valid (one-valid, Horn, dual Horn, IHSB-) if every i? G F is. 

3 Mergeability 

The characterization of the dichotomy of the kernelizability of Min Ones SAT(F) given in this 
paper centers around a newly introduced property we refer to as mergeability. Specifically, 
we will see that for any finite set F of relations over the boolean domain, Min Ones SAT(F) 
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admits a polynomial kernelization if either Min Ones SAT(r) is in V or every relation i? E F 
is mergeable; in every other case, Min Ones SAT(r) admits no polynomial kernelization unless 
MV C co-AAP/poly (which would imply that the polynomial hierarchy collapses to the third 
level). In this section, we define this property and give some basic results about it. 

Definition 1. Let i? be a relation on the boolean domain. Given four (not necessarily distinct) 
tuples a, /3, 7, (5 E i?, we say that the merge operation applies \i a/\5 < (5 < a and (3A^<6<j. 
If so, then applying the merge operation produces the tuple aA(/3V7). We say that R is mergeable 
if for any four tuples a, fi,^,6 & R for which the merge operation applies, we have aA(/3V7) G R. 

We show some basic results about mergeability. First, we show an alternate presentation of 
the property; this perspective will be important in Section HI when sunflowers are introduced. 

Proposition 1. Let R be a relation of arity r on the boolean domain. Partition the positions 
of R into two sets, called the core and the petals; w.l.o.g. assume that positions 1 through c are 
the core, and the rest the petals. Let {ac-, ocp), where ac is a c-ary tuple and ap an (r — c)-ary 
tuple, denote the tuple whose first c positions are given by ac, and whose subsequent positions 
are given by ap. Consider then the following four tuples. 

a = {ac,ap) 

P = (ac,0) 

7 = ilclp) 
5 = (7c,0) 

If a through 6 are in R, then the merge operation applies, giving us 

{ac,ap A7p) G R. 

Furthermore, for any four tuples to which the merge operation applies, there is a partitioning 
of the positions into core and petals such that the tuples can be written in the above form. 

It is straight-forward that this property is preserved by implementations. 

Proposition 2. Mergeability is preserved by assignment and identification of variables, i.e., 
if R is mergeable, then so is any relation produced from R by these operations. Further, any 
relation implementable by mergeable relations is mergeable. 

Next, we show what mergeability implies for a zero- valid relation. 

Lemma 1. Any zero-valid relation R which is mergeable is also IHSB-, and can therefore be 
implemented using negative clauses and implications. 

Proof. We show that aA(/3V7) € R for all tuples a, (3,^ € R. First, for any two tuples a, f3 € R, 
we can apply the merge operation to the tuples a, 0, P, 0, to show that q A /? G i?. It can then 
be checked that the operation applies to the tuples a, {a A (3), (a A 7), and (a A /? A 7), and 
that this implies a A (/3 V 7) G i?. As previously mentioned, this shows that R is IHSB-. Note 
that assignments add no expressive power when R is zero- valid. □ 

Note that by Proposition^ this shows that the only zero- valid relations that can be produced 
from a mergeable relation by assigning or identifying variables are IHSB-. However, this still 
leaves room for other positive examples; e.g., the constraints {x + y + z = 1 (mod 2)) and ((x = 
y) z) are both mergeable. 
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4 Kernelization 



In this section, we show that Min Ones SAT(r) admits a polynomial kernelization if all relations 
in r are mergeable. For the purpose of describing om' kernelization we first define a sunflower 
of tuples, similarly to the original sunflower definition for sets. We point out that a similar 
though more restricted definition for sunflowers of tuples was given by Marx [17J; accordingly 
the bounds of our sunflower lemma are considerably smaller. 

Definition 2. Let U he a finite set, let d G N, and let Ti C W^. A sunflower {of tuples) with 
cardinality t and core C C {1, . . . , d} in W is a subset consisting of t tuples that have the same 
element at all positions in C and, in the remaining positions, no element occurs in more than 
one tuple. The set of remaining positions P = {1, . . . , d} \ C is called the petals. 

As an example, (xi, . . . , Xc, yn, . . . , yip), . . . , (xi, . . . ,Xc,yti, ■ ■ ■ , Vtp) is a sunflower of car- 
dinality t with core C = {!,..., c}, if all yij and are distinct when i ^ i' . Note that, 
differing from Marx [T7] variables in the petal positions may also occur in the core. For sets 
of tuples 7i C W^, we give a variant of Erdos' and Rado's Sunflower Lemma [llj . The proof 
is along the same lines as the original, only requiring an additional factor of d\ for picking the 
shared core positions. Same as the Sunflower Lemma, this immediately gives a polynomial-time 
algorithm for finding a sunflower of tuples. 

Lemma 2. Let U be a finite set, let d G N, and let Ti. C W^. If the size of Ti is greater 
than k'^{dl)'^ , then it contains a sunflower of cardinality k + 1. 

Proof. If d = 1, then a sunflower of size k + 1 can be easily found, since any k + 1 tuples of 
arity d = 1 form a sunflower with empty core. Now for induction, assume the lemma to be true 
for all d' <d-l. 

Let X contain {xi, . . . , Xd} for each tuple {xi, . . . ,Xd) S Ti.. Select a maximal pairwise 
disjoint subset F X. If [F[ > A;-|- 1 then its elements correspond to A; + 1 tuples that share no 
variable, i.e., a sunflower with empty core. Otherwise, if \F\ < k, then all other sets of X have 
a non-empty intersection with some element of F. Since the sets correspond to the tuples of Ti 
there must be an element of some set in F, say x, that occurs in at least \?{\/kd tuples of Ti, as 
there are at most kd such elements. Therefore, there must be a position, p € {1, . . . ,d}, such 
that X occurs in position p of at least \'H\/kd^ > k'^~^{{d — 1)!)^ tuples of TC. 

Define H' by H' = {{xi,. . . ,Xp_i,Xp+i, ...,Xd) \ (xi, . . . , Xp_i, x, Xp+i, . . . ,Xrf) G 7i}. Ob- 
serve that Ti' C W^'^ and \Ti'\ > k'^~^{{d — 1)!)^, implying that a sunflower of cardinality k + 1 
in Tl' can be found in Ti'; immediately giving a sunflower in Ti. □ 

Our kernelization requires also the notion of a zero-closed position and the related zero- 
closure of a relation. The following definition introduces these concepts. 

Definition 3. Let R be an r-ary relation. The relation R is zero-closed on position i, if for 
every tuple (ti, . . . , ti-i,ti, ti+i, . . . ,tr) £ Rwe have (ti, . . . , 0, tj+i, . . . ,tr) & R. A relation 
is non-zero-closed if it has no zero-closed positions. We define functions A, 11, and V: 

• Ap{R) is defined to be the zero-closure of R on positions P C {1, . . . , r}, i.e., the smallest 
superset of R that is zero-closed on all positions i & P. 

• n(i?) denotes the non-zero-closed core, i.e., the projection of R onto all positions that are 
not zero-closed or, equivalently, the relation on the non-zero-closed positions obtained by 
forcing Xj = for all zero-closed positions. 
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• Vc{R) denotes the sunflower restriction of R with core C: w.l.o.g. the relation expressed 
by [Vc(-R)](xi, . . . ,Xr) = R{xi, . . . ,Xr) A Rixi, . . . ,Xc,0, . . . ,0) for C = {1, . . . ,c}. The 
corresponding core relation is the |C|-ary relation given by R{xi, . . . , Xc, 0, . . . , 0). 

The mappings Ap and Vc extend also to constraints: Vc{Rixi, ■ ■ ■ , Xr)) = [Vc(i?)](xi, . . . , Xr)- 
Similarly for IT, but variables in zero-closed positions are removed, e.g., when i is the only zero- 
closed position of R then n(i?(xi, . . . , Xr)) = [Il{R)]{xi, . . . , Xj+i, . . . , Xr). 

Lemma 3. Let R be a mergeable relation and let C U P be a partition of its positions into core 
and petals. There is an implementation ofVciR) using Ap(Vc(i?)) and implications. 

Proof. By Prop. O Vc(i?) must be mergeable. Further, assigning any set of values to the 
variables in the core produces a zero-valid relation on the petals, which is still mergeable. Thus 
by Lemma [H this relation on the petals has an implementation using negative clauses and 
implications. For any tuple a in Vc(i?) or Ap(Vc(i?)), let its core assignment be the values it 
assigns to the positions in C. By definition, we have Vc(-R) C Ap(Vc(i2)). 

Consider now a tuple a G Ap(Vc(^)) \ Vc(-R). Assume that a makes core assignment ac; 
thus there is a matching a € Vc(-R), a > cr, with an identical core assignment. As in Prop. [H 
write a = {ac,ctp), and consider the constraint on the petals that is formed by core as- 
signment ac- We see that this constraint must entail some implication (yj yj), where a 
assigns Ui = l,yj = while ap assigns yi = yj = 1. Further, since Vc(i?) is a sunflower 
restriction, we have (3 = {ac,0) G Vc{R)- Now if (t/j — > yj) does not hold in Vc(-R) in 
general, then let 7 = (jc^lp) ^ ^c{R) be a tuple which assigns yi = 1, yj = 0, and 
let 5 = (7C;0) G Vc(-R). By Prop. [H we can now apply the merge operation to tuples a 
through 6, showing {ac, ap A 7p) G Vc(i?). But this is a tuple with core assignment ac which 
assigns y^ = 1, yj = 0, which is a contradiction. Thus {yi — > yj) holds in Vc{R) regardless of 
core assignment, and the constraint {yi yj) can be added to our implementation of Vc(-R), 
removing the tuple a. 

Adding all implications between petals which hold in Vc(i?) removes from our implementa- 
tion all tuples which are in Ap(Vc(-R)) but not in Vc{R), so that the conjunction of Ap(Vc(-R)) 
with all valid implications is an implementation of Vc(i?). □ 

The following technical lemma proves that the relation Ap(Vc(-R)), required by Lemma El 
is mergeable. 

Lemma 4. Let R be a mergeable relation and let C L) P be a partition of its positions into core 
and petals. Then Ap(Vc(i?)) is mergeable. 

Proof. Recall that Ap{\7c{R)) is the zero-closure on the petal positions of the sunflower re- 
striction of R with core C. Let R' := Ap{Vc{R))', assume by way of contradiction that R' is 
not mergeable. Then there are four tuples in R' such that applying the merge operation on the 
tuples creates a tuple not in R' . Let C U P' be the partition of the positions of R' into core 
and petals that is used in this counterexample. Grouping the positions of R' in four groups, 
written in the order {C riC,C' D P,P' D C,P' D P), naming the groups W through Z, the 
counterexample can be written as follows. 



{Wi,Xi,Yi,Zi) G R' 

{Wi,Xi,0,0) G R' 

{W2,X2,Y2,Z2) G R' 

(1^2,^2,0,0) G R' 



(1) 
(2) 
(3) 
(4) 
(5) 



{Wi,Xi,YiAY2,ZiAZ2) i R! 



6 



We will derive a contradiction. First, we note that for each equation there is a corre- 
sponding tuple in Vc(-R)- 

{Wi,Xia,Yi,Zia) G Vc(i?) (6) 

(Wi,Xn,,^,Zh) G Vc{R) (7) 

(VF2,X2a,l2,^2a) G Vc(i?) (8) 

(TF2,X2b,0,Z,) G Vc{R) (9) 

Here, and X^, are supersets of Xi, and likewise for X2, .Z^i, and Z2. Z^^ and are arbitrary. 
Using that Vc{R) is a sunflower restriction and mergeable, we can conclude the following. 

(1^1,0,0,0) e Vc{R) (10) 

(1^2,0,0,0) G Vc(i?) (11) 

(H^l,Xi„AX2a,Fi Ay2,^la AZ2a) G Vc(i?) (12) 

The first two come from ([7]) and ([9]); the third is produced by a merge operation on ([6]) and ([8]) 
using these two. Now, the tuples which match Wi on the VF-variables form a zero- valid relation. 
By Lemma [H this relation is closed under an operation (a A (/3 V 7)). Applying this on the 
tuples of equations dS]), ([7]), and (fT2]) gives us the following conclusion. 

{Wi,Xia A {Xib V X2a), ll A Y2, Zia A (^6 V Z2a)) G Vc(i?) (13) 

In particular, this tuple matches ([5]) on the W- and y-variables, and is a superset of it on the 
X- and Z-variables. Since R' is the zero-closure of Vc{R) on the X- and Z-variables, we have 
a contradiction. □ 

Lemmas [3] and m are the foundation for a sunflower-based kernelization for Min Ones SAT(r). 
They show that the sunflower restriction Vc(i?) of some mergeable /^-constraint can be imple- 
mented using its mergeable zero-closure on the petal positions as well as implications. How- 
ever, Ap(Vc(-R)) is not necessarily contained in F and such a replacement does not give any 
immediate reduction of the instance. Indeed, the arity of Ap(Vc(i?)) is the same as that of R. 

We address this problem by introducing a new measure of difficulty for formulas, namely 
the sum of non-zero-closed cores, based on the following definition. 

Definition 4. Let J- he a. formula and let i? be a relation. We define Z{T, R) as the set of all 
tuples (xi, . . . , xt) where [n(i?)](xi, . . . , xt) is the non-zero-closed core of an i?-constraint in J^. 

For some relations, sunflower restriction, zero-closure, and the relation itself are the same, for 
certain selections of core and petal positions. See for example the following mergeable relation: 

ii ={(0,0, 1,0), (0,1, 0,0), (0,1, 0,1), (1,0, 0,0), (1,0, 0,1), (1,1, 1,0), (1,1, 1,1)} 

=V{1,2,3}(«) = A{4}(V{i,2,3}(^)) 

This is also one of the smallest examples, where a sunflower restriction cannot be expressed using 
the core relation (i.e., all tuples for the core such that the petal variables can take value 0), 
implications, and negative clauses. Here, the core relation is {(0, 0, 1), (0, 1, 0), (1, 0, 0), (1, 1, 1)}, 
but no implication or negative clause can exclude the tuple (0,0,1,1) without also excluding 
other tuples that do occur in the sunflower restriction. Thus there are mergeable relations for 
which a sunflower-based reduction using Lemma [3] does not lead to any simplification, even in 
terms of Z{T,R). We overcome this difficulty by searching for sunflowers among the tuples 
of Z{T, R). Those are leveraged into a replacement of the ii-constraints that contributed these 
tuples. The following theorem shows this approach in detail. 
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Theorem 2. Let T be a mergeahle constraint language with maximum arity d. Let T he a 
formula over V and let k be an integer. In polynomial time one can compute a formula T' over 
a mergeahle constraint language F' ^ F with maximum arity d, such that every assignment of 
weight at most k satisfies T if and only if it satisfies T' and, furthermore, \Z{T' , R)\ € 0{k'^) 
for every non- zero-valid relation that occurs in T' . 

Proof. We begin constructing T', starting from J^' = T . While \Z{T\ > k'^{dl)'^ for any non- 
zero-valid relation R in J-', search for a sunflower of cardinality A; + 1 in Z{J^',R), according 
to Lemma El Let C denote the core of the sunflower and apply the following replacement. 
Remove each i2-constraint whose non-zero-closed core matches a tuple of the sunflower, and 
add its sunflower restriction with core C using an implementation according to Lemma El 
Repeating this step until \Z{J^',R)\ < k'^{d\f for all non-zero-valid relations R in T' completes 
the construction. 

Now, to prove correctness, let us consider a single replacement. We denote the tuples of 
the sunflower by (xi, . . . ,Xc,yii, . . . ,yip), with i G {1, . . . , k + 1}, i.e., w.l.o.g. with core C = 
{1, . . . , c} and petals P = {c + 1, . . . , c + p}. Let cp be any satisfying assignment of weight 
at most k and consider any tuple {xi, . . . ,Xc,yii, . . . ,yip) of the sunflower. There must be 
a constraint R{xi, . . . , Xc,yii, ■ ■ ■ , yip, zi, . . . , zt) whose non-zero-closed core matches the tuple, 
w.l.o.g. we take the last positions of R to be zero-closed, let Z be those positions. Thus (j) 
must satisfy R{xi, . . . , Xc,yii, . . . , yip, 0, . . . , 0), since the Zi are in zero-closed positions. Ob- 
serve that, by maximum weight k, the assignment (j) assigns to all variables yii,...,yip 
for an i € {1, . . . ,k + 1}. Thus </> satisfies also R{xi, . . . , Xc, 0, . . . , 0). Hence for any con- 
straint R{xi, . . . ,Xc,yii, . . . ,yip, zi, . . . , Zt), it satisfies "^ciRixi, . . . ,Xc,yii, . . . ,yip, zi, . . . , zt)) 
too. This permits us to replace each /^-constraint, whose non-zero-closed core matches a tuple 
of the sunflower, by an implementation of its sunflower restriction with core C, according to 
LemmaO The implementation uses Apuz(Vc(-R(xi, . . . , Xc,yii, ■ ■ ■ , yip, z\,..., zt))) and impli- 
cations. By Lemma jH the added constraints Apuz(Vc(-R(a;i, . . . , Xc, ■, ■ ■ ■ , .))) are mergeable, 
implying that all constraints in !F' are mergeable. 

To establish that the construction can be performed efficiently, i.e., in time polynomial in 
the size of J-, we use as a measure of J-"' the sum of \Z{T',R)\ over all relations R occurring 
in J^'. First, let us observe that, initially, this measure is bounded by the size of J- since 
each i?-constraint of J^' contributes at most one tuple to the corresponding set Z{T' , R) (recall 
that we start with JF' = JF). Consider again the replacement made in each step: All i?- 
constraints matching one of the tuples of the sunflower are replaced by an implementation 
using R = Apuz{S/c{R)) and implications. It is crucial to observe that all added constraints 
contribute the same tuple to Z{T',R), consisting only of variables with positions in C. This 
is caused by the application of the zero closure A on all positions but those in C . Hence 
the A; + 1 tuples of the sunflower are removed, as all matching ii-constraints are replaced, and 
only one new tuple is added to the set Z{T' , R). This decreases the measure, implying that the 
modification step is applied at most a number of times polynomial in the size of JF. 

Finally let us express the fact that each iteration of the replacement can be done effi- 
ciently. The set Z{T',R) can be generated in one pass over the formula and since the arity is 
bounded by d there is only a constant number of relations. The applications of Lemma [2] to 
find a sunflower among the tuples of the sets Z{T' ,R) take time polynomial in \J-\, since the 
size \Z{T' , R)\ £ 0([JF|). Observe that the size of J-' is bounded by a polynomial in \J^\ at all 
times, since there is only a polynomial number of possible constraints of arity at most d on the 
variables of J-". □ 

Now we are able to derive a polynomial kernelization for Min Ones SAT(F). For a given 
instance {J-, k), it first generates an equivalent formula JF' according to Theorem[2l However, J^' 
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will not replace T ^ rather, it allows us to remove variables from !F based on conclusions drawn 
from T' . This approach avoids the obstacle of a possible lack of expressibility from using only 
the language F, and requires no additional assumptions or annotations to be made. 

Theorem 3. Let T be a mergeahle constraint language. Then Min Ones SATfT) admits a 
polynomial kernelization. 

Proof. Let (JF, k) be an instance of Min Ones SAT(r) and let d be the maximum arity of 
relations in T. According to Theorem [21 we generate a formula J^', such that assignments of 
weight at most k are satisfying for if and only if they are satisfying for J^'. Moreover, for each 
non-zero-valid relation R, we have that \Z{J^',R)\ G 0{k'^). Note that constraints of J-' have 
maximum arity d. We allow the constant to be used for replacing variables; a construction 
for this not using (x = 0) follows at the end of the proof. 

First, according to Lemma [H we replace each zero- valid constraint of J-' by an implemen- 
tation through negative clauses and implications. Next, we address variables that occur only 
in zero-closed positions constraints in . By definition of zero-closed positions it is immediate 
that setting such a variable to 0, does not affect the possible assignments for the other variables. 
By equivalence of J- and J-' with respect to assignments of weight at most /c, the same is true 
for J-. We replace all such variables by the constant in ^ and J-\ maintaining the equivalence 
with respect to assignments of weight at most k. 

Now, let X be the set of variables that occur in a non-zero-closed position of some non-zero- 
valid constraint of J-' . For each variable x ^ X count the number of variables that are implied 
by X, i.e., that have to take value 1 if x = 1, by implication constraints in T' . If the number of 
those variables is at least k, then there is no satisfying assignment of weight at most k for J^' 
that assigns 1 to x. By equivalence of J- and JF' with respect to such assignments, we replace all 
occurrences of such a variable x by the constant 0, again maintaining the equivalence property. 
Finally we replace all variables y € V{J-') \X, that are not implied by a variable from X in T' , 
by the constant in .F and !F' . Note that such variables y occur only in zero-closed positions 
and in implications. It can be easily verified that this does not affect satisfiability with respect 
to assignments of weight at most k. For efficiency of this modification consider the fact that 
the number of implications in T' is polynomial in the initial size of since there are at most 
two implications per pair of variables of J-. This completes the kernelization. 

Now we prove a bound of 0{k'^^^) on the number of variables in T. First, we observe that 
all remaining variables of J- must occur in a non-zero-closed position of some constraint of J-' . 
We begin by bounding the number of variables that occur in a non-zero-closed position of some 
non-zero-valid i?-constraint, i.e., the remaining variables of the set X. Observe that such a 
variable must occur in the corresponding tuple of Z{T\R). Since there is only a constant 
number of relations of arity at most d and since Z{J^',R) G 0{k'^), this limits the number of 
such variables by 0{k'^). For all other variables, their non-zero-closed occurrences must be in 
implications, since negative clauses are zero-closed on all positions. Thus, these variables must 
be implied by a variable of X. Since each variable implies at most k — 1 other variables, we 
get an overall bound of 0{k'^^^). Finally, the total size of J- is polynomial for a fix d, since the 
number of variables is polynomial and the arity of the constraints is bounded. 

To express the 0-constant, we add k + 1 new variables zi, . . . , Zk+i- Every constraint with 
at least one is replaced by A; -|- 1 copies, each time replacing with a different Zj. Clearly one 
of the Zi takes value in any assignment of weight at most k. Hence the original constraints 
with constant are enforced. Conversely, given a satisfying assignment of weight at most k for 
the formula before making this replacement, we can easily extend it by assigning to each Zj. 
This construction does not affect our upper bound on the number of variables. □ 
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5 Kernel Lower Bounds 



We will now complete the dichotomy by showing that if Min Ones SAT(r) is A/''P-complete and 
some G r is not mergeable, then the problem admits no polynomial kernelization unless NV 
C co-A/''P/poly. The central concept of our lower bound construction is the following definition. 

Definition 5. A log-cost selection formula of arity n is a formula on variable sets X and Y , 
with |y| = n and \X\ = 'nP^'^\ such that there is no solution where Y = but for any yi ^Y 
there is a solution where yi = 1, yj = for j ^ i, and where a fix number Wn = O(logn) 
variables among X are true. Furthermore, there is no solution where fewer than Wn variables 
among X are true. 

We will show that any T as described can be used to construct log-cost selection formulas, 
and then derive a lower bound from this. The next lemma describes our constructions. 

Lemma 5. The following types of relations can implement log-cost selection formulas of any 
arity. 

1. A 3-ary relation R3 such that {(0,0,0), (1,1,0), (1,0, 1)} C ijg and (1,0,0) ^ R3, together 
with relations {x = 1) and (x = 0). 

2. A 5- ary relation such that {(1, 0, 1, 1, 0), (1, 0, 0, 0, 0), (0, 1, 1, 0, 1), (0, 1, 0, 0, 0)} C i^g 
and (1, 0, 1, 0, 0), (0, 1, 1, 0, 0) ^ R^, together with relations [x 7^ y), (x = 1), and (x = 0). 

Proof. Let Y = {yi, ...,?/„} be the variables over which a log-cost selection formula is requested. 
We will create "branching trees" over variables for < i < log2?^, 1 < J < 2% as variants 
of the composition trees used in [16]. Assume that n = 2^ for some integer h; otherwise pad Y 
with variables forced to be false, as assumed to be possible in both constructions. 

The first construction is immediate. Create the variables Xij and add a constraint (xo,i = 1). 
Further, for all with < i < /i and 1 < j < 2*, add a constraint i?3(xjj, Xj+i^2j-ii a;j+i^2j)- 
Finally, replace variables Xh,j by yj. By the requirements on R3, for every internal variable Xjj, 
if Xi^j = 1 then one of its children Xi+i.2j-i and Xi+i^2j must be true. Thus by transitivity, 
some variable on each level of the branching tree must be true, making y = is impossible. 
Conversely, for any variable yi on the leaf level, there is a solution where exactly the variables 
along the path from the root node to yi are true. Thus Wn = h = log2 n. 

The second construction uses the same principle, but the construction is somewhat more 
involved. Create variables Xij and a constraint (xo,i = 1) as before. In addition, introduce 
for every < i < h — 1 two variables li, ri and a constraint (Zj 7^ ^j). Now the intention is 
that {li,ri) decides whether the path of true variables from the root to a leaf should take a left 
or a right turn after level i. Concretely, add for every i,j with < i < h — 1 and 1 < j < 2' 
a constraint i?5(/j, rj, Xjj , Xi+i^2j-i) a^i+i,2j)- Now for every true variable Xij, it is not allowed 
that Xi+i^2j~i = Xi+i^2j = 0, while depending on li and rj, either (xi+i^2j-i = ^,Xi+i^2j = 0) 
or (xj+i^2j-i = 0,Xj4.i^2j = 1) is allowed. This rules out the case Y = 0, while for each set of 
values of li, ri it is allowed to set among variables Xij exactly the variables along a path from 
the root to a leaf yi to true, and other variables to false. In total, exactly two variables not 
among Y are true per level in such an assignment, making Wn = 2h = 21og2 n. □ 

We now reach the technical part, where we show that any relation which is not mergeable 
can be used to construct a relation as in Lemma El The constructions are based on the concept 
of a witness that some relation R lacks a certain closure property. For instance, if R is not 
mergeable, then there are four tuples a, (3,^,5 G R to which the merge operation applies, but 



10 



such that a A (/3 V 7) ^ R] these four tuples form a witness that R is not mergeable. Using the 
knowledge that such witnesses exist, we use the approach of Schaefer jl9j . identifying variables 
according to their occurrence in the tuples of the witness, to build relations with the properties 
we need. 

Lemma 6. Let T he a set of relations such that Min Ones SATfT) is MV -complete and 
some R T is not mergeable. Under a constraint that at most k variables are true, T can 
be used to force {x = 0) and {x = 1). Furthermore, there is an implementation of (x = y) 
using R, (x = 0), and (x = 1). 

Proof. First of all, we show how to force (x = 1). Since Min Ones SAT(r) is A/'T'-complete, it 
contains some relation that is not zero-valid; let i? € T be such a relation. If R is one-valid, 
then i?(x, . . . , x) is equivalent to (x = 1). Else, let r be the arity of R and let / be a maximal 
set such that R{xi, . . . ,Xr) holds for Xj = 1 for z G /, Xi = else. Identify all Xj, i G /, to a 
single variable x, and all Xj, i ^ /, to a single variable y. This forms a new constraint R'{x,y), 
where (1,0) € R'{x,y) and (0,0), (1, 1) ^ R'{x,y). Thus R' is either (x = 1 A y = 0) or (x / y). 
In the former case we are done; in the latter case, constraints x ^ yi for 1 < i < A; + 1 force x = 1 
and all = in any solution with at most k true variables. 

Now we can use this to force (x = 0) and (x = y). Let a through 5 be a witness that R 
is not mergeable; let cr = a A (/3 V 7) ^ R he the produced tuple. Notice that (3 < a < a, 
meaning that the positions of R are of four types: those where P < a, those where a < a, and 
optionally positions which are constant among these tuples, i.e. true in f3 or false in a. Call 
these positions Cx, Cy, Ci, and Co, in the order they were introduced. Place a variable zi = 1 
in all positions Ci, if any, and variables x and y in all positions Cx resp. Cy. Now, if there are no 
positions Co, then this creates a constraint R'{x, y) such that i?'(x, y)AR'{y, x) implements (x = 
y) directly. This can be used to force x = 0: create k variables yi and let x = yi for every i. In 
any solution with at most k true variables, all these variables are false. 

Otherwise, if there are positions Co, then place the variable y in these positions as well, and 
apply R'{x,y) A R'{y,x) again; the result is either (x = y) or (x = y = 0). Finally, placing a 
variable zq = in positions Cq lets us implement (x = y) as above. □ 

Lemma 7. Let Min Ones SATfT) be NP-complete, and not mergeable. Then Min Ones SATfT) 
can express a log-cost selection formula of any arity. 

Proof. Let i? € F be a relation that is not mergeable, and let a through Shea witness of this. By 
Prop.[Tl partition the positions of R into core and petals in a way that agrees with the witness. 
Group the variables w.r.t. their values in these four tuples into constant variables Zi, Zq, non- 
constant core variables Cio and Coi, and non-constant petal variables Pu, Piq, Pqi (where the 
indices indicate membership in a and 7, as /? and 6 are now determined by this). Identify 
variables according to type, and order them in the order of the previous sentence. We now 
have a relation whose arity depends on which variable types that are represented in the witness. 
In the case that all seven types are present, we have implemented a 7-ary relation Rj about 
which we know the following (the final tuple is produced on the witness tuples by the merge 
operation). 



1,0,1,0,1,1,0) 


G 


R7 


1,0,1,0,0,0,0) 


G 


R7 


1,0,0,1,1,0,1) 


G 


R7 


1,0,0,1,0,0,0) 


G 


Rr 


1,0,1,0,1,0,0) 




R7 
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To distinguish the final tuple from the witness tuples, we can observe that variable types Pu, Piq, 
and one further non-constant variable type must be represented by the witness. The constant 
positions can be ignored by putting variables zi = 1 and = in these positions, by Lemma El 
Thus we implement a relation of arity between three and five. 

First assume that R is dual Horn. Then the tuple /3 V 7 € ii, i.e. (1, 0, 1, 1, 1, 0, 1) G Ry, and 
the variable type Cqi or Pqi must occur. Identify Cqi and Pqi if both occur, and set Cio = 1 
if this type occurs, implementing a 3-ary relation R' which matches R3 of Lemma O with the 
variable types being Pu, Pio, and (Pqi = Cqi) in the order used in Lemma [5l Indeed, {a, (3 V 
7,(9} C P, representing the positive requirement, while there can be no tuple (1,0,0) € R', 
whether Cio occurs or not. This and Lemma [6] fulfills the conditions of Lemma [Sj part 1. 

Otherwise R is not dual Horn, in which case it is not closed under disjunction. Using a 
witness for this, we can implement a 2-ary relation R2 which is either (x 7^ y) or (-ix V -ly). 
Likewise, by AAT'-completeness we have a relation which is not Horn, which can implement {x ^ 
y) or (x V y). Combining them, we find that we can always implement (x 7^ y), and thus are 
free to use P5 of Lemma \5\ We implement a relation R' as before, again letting the variable 
types appear in the order (Cio, Cqi, Pu, Pio, Poi)- We go through the cases of non-empty non- 
constant variable types, and show that our relation R' can implement a relation matching P3 
or P5 of Lemma [5l 

1. If R' has arity three, with the third variable type being Pqi or Cqi, then we implement a 
relation matching P3 with {(1, 1, 0), (1, 0, 1), (0, 0, 0)} C R' and (1, 0, 0) ^ R'. 

2. If R' has arity three, and the third type is Cio, then we implement a relation R' with 
{(1, 1, 1), (1, 0, 0), (0, 1, 0), (0, 0, 0) C R' and (1, 1, 0) ^ P'. Use R'{v, x, y) A R'{w, x, z) to 
implement a relation matching P5. 

3. If the core type Ciq is not present, then identify Cqi with Pqi. This implements a rela- 
tion R' matching P3. 

4. If the core type Cqi is not present, we need two cases. If (0, 1, 0, 0) ^ P', then identify Cio 
with Pio to produce a 3-ary relation matching P3. Otherwise, force Pqi = to produce a 
3-ary relation as in case [21 

5. If the petal type Poi is not present, then R'{v, w, x, y)AR'{w, v, x, z) implements a relation 
matching P5. 

6. If all five types are present, then P'(f , x, y, z) A R'{'w, v, x, z, y) implements a relation 
matching P5. 

Thus in every case, we meet the conditions of part 1 or 2 of Lemma O D 

We now show our result, using the tools of [6j. We have the following definition. Let Q 
and Q' be parameterized problems. A polynomial time and parameter transformation from Q 
to Q' is a polynomial-time mapping : S* x N ^ S* x N : (x. A;) 1-^ (x', k') such that 

V(x, A;) G S* X N : ((x, k) £ Q ^ (x', k') G Q') and k' < p{k), 

for some polynomial p. 

We will provide a polynomial time and parameter transformation to Min Ones SAT(r) from 
Exact Hitting Set(m), defined as follows. 

Input: A hypergraph 7i consisting of m subsets of a universe U of size n. 
Parameter: m. 

Task: Decide whether there is a set S (Z U such that |P fl S"! = 1 for every E G TC. 
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It was shown in [6J that polynomial time and parameter transformations preserve polynomial 
kernelizability; thus our lower bound will follow. To establish a lower bound for Exact Hitting 
Set(m), we need the following notions from E]. Let Q be a parameterized problem. A 
composition algorithm for Q is an algorithm that on input (xi,k),... ,{xt,k) C S* x N uses 
time polynomial in Yll=i l^^il + ^ ^^'^ outputs {y, k') with k' bounded by a polynomial in k and 
such that (y, k') € Q if and only if (xj, A;) S Q for at least one i G {1, . . . , t}. The problem Q is 
then said to be compositional. 

The derived classical problem Q of Q is defined by Q = {xjj^l^ \ (x, k) € Q}, where # ^ S 
is the blank letter and 1 is any letter from E. 

Following Dom et al. j9], we next give our equivalence of a colored version of the problem, 
which we call Exact CSP(m + n), defined as follows. 

Input: A CSP instance with n variables of arbitrary finite domain, and m con- 
straints Exactly-One(?;j^ = hi^,...,Vi^ = hi^) of arbitrary arity, where each Vi is a 
variable and bi a value from the respective variable domain. 
Parameter: m + n. 

Task: Decide whether there is an assignment of a value to every variable that 
satisfies each constraint (i.e. for each constraint, exactly one statement v = b \s 
true). 

We show that Exact CSP(m + n) admits no polynomial kernelization; the result follows by a 
trivial problem reduction from Exact CSP(m + n) to Exact Hitting Set(m). The proof follows 
the same lines as the lower bound for Unique Coverage in [9l Sec. 4.2], but the construction 
is somewhat simplified, and the lower bound somewhat stronger (as Exact Hitting Set(m) is 
equivalent to a special case of Unique Coverage) 

Lemma 8. Exact Hitting Set{m) admits no polynomial kernelization unless MV C co-MV /poly. 

Proof. First, Exact CSP(m+n) is AAP-complete (even if all domains have cardinality 2, in which 
case it is the Exact Satisfiability problem). Also, the problem can be solved in time 0*{n'^). 
Decide for each constraint the identity of the variable which will hit it (but not yet its value). 
Assuming that a variable v is chosen for a particular constraint, for every statement {vi = bj) in 
the constraint with v ^ Vi, remove the value j from the domain of the variable Vi, and restrict 
the domain of v to those values which would hit the constraint. Repeat for all constraints, 
backtracking if necessary; the size of the search tree is at most n™. Thus, we may assume in 
our composition algorithm that the number of input instances is bounded by n"* (or else we 
solve all instances in time polynomial in the total input size). 

Assume, then, that there are t input instances. Let n be the maximum number of variables 
and m the maximum number of constraints; for simplicity of the argument, assume that all 
input instances have the same numbers of variables and constraints (or else do trivial padding 
with unary-domain variables or trivially true constraints, such as "variable 1 has exactly one 
value"). Number the variables from 1 to n and the constraints from 1 to m in each input 
instance. 

Now create the composed instance. First collect all the values of variables numbered i into 
the domain of a single variable v'^, say with values (ji, ^'2) signifying "value j2 in the domain of 
input instance ji". Similarly concatenate all constraints numbered i into a single constraint, 
over these new domain values. Note that values stemming from different instances are different, 
so that a constraint is hit only once in an intended solution (where all values come from the same 

^Dom et al. also give a proof for the problem Bipartite Perfect Code, which is the same underlying 
problem as here, but the parameterization is different. 
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instance). We finally need to add constraints to ensure that all variables take values stemming 
from the same input instance. 

For this, assign to each input instance a number from 1 to t as its ID, and write this in 
binary form. Let I = [log2t] = 0(m log n) be the number of bit levels needed. For each pair of 
values + 1), 1 < i < n, and each bit level j, I < j < I, add a testing constraint consisting 
of all values of Vi for which the j:th digit of the ID of the originating instance is 1, and all 
values of Uj+i for which the j:th digit of the ID is 0. This makes 0(mn log n) extra edges. If 
variables Vi and vj take values from different input instances, then their IDs will differ in some 
position, which will lead to one of these testing edges being hit twice or not at all. Otherwise, 
each testing edge is hit exactly once. By transitivity, this forces all variables in the composed 
instance to take values from the same input instance. This completes the compositionality 
proof, showing that Exact CSP with parameter m + n admits no polynomial kernelization. 

The result for Exact Hitting Set with parameter m follows by a simple reduction. Let the 
vertices of the hitting set be the individual variable values, create for each variable an edge 
containing all its values, and retain all constraints as edges. We get an equivalent instance 
with m + n edges. □ 

We can now show the main result of this section. 

Theorem 4. Let T be a constraint language which is not mergeable. Then Min Ones SATfT) 
is either polynomial-time solvable, or does not admit a polynomial kernelization unless AfV C 
co-MV /poly. 

Proof. By Theorem [H Min Ones SAT(r) is either polynomial-time solvable or TVP-complete; 
assume that it is A/'P-complete. By Lemma[6]we have both constants and the constraint (x = y), 
and by Lemma [7] we can implement log-cost selection formulas. It remains only to describe the 
polynomial time and parameter transformation from Exact Hitting Set(rn-) to Min Ones SAT(r). 

Let be a hypergraph. If TC contains more than 2™ vertices, then it can be solved in 
time polynomial in the input length [3]; otherwise, we create a formula J-" and fix a weight k 
so that {J-, k) is positive if and only if Ti. has an exact hitting set. Create one variable j- 
in for every occurrence of a vertex Vi in an edge Ej in 7i. For each edge -E € 7Y, create 
a selection formula over the variables representing the occurrences in E. Finally, for all pairs 
of occurrences of each vertex Vi, add constraints {yij = yi,j'), and fix A; = m + YliEeH'^\E\^ 
where Wi is the weight of an i-selection formula. We have an upper bound on the value of k 
of 0(m- log n) = 0{m?). 

Now solutions with weight exactly k correspond to exact hitting sets of Ti. Note that k is 
the minimum possible weight of the selection formulas, which is taken if exactly one occurrence 
in each edge is picked. By the definition of log-cost selection formulas, any solution where more 
than one occurrence has been picked (if such a solution is possible at all) will have a total weight 
which is larger than this, if the weight of the y-variables is counted as well, and thus such a 
solution to T of weight at most k is not possible. 

As Exact Hitting Set(m) is A/''P-complete, it follows from [6] that a polynomial kernelization 
for Min Ones SAT(r) would imply the same for Exact Hitting Set(m), giving our result. □ 

Finally, let us remark that Lemma[6]can be adjusted to provide {x = 1) and (x = y) without 
the use of repeated variables, and that using standard techniques (see [TBJ and Theorem [3]), we 
can show that the lower bound still applies under the restriction that constraints contain no 
repeated variables. Such a restriction can be useful in showing hardness of other problems, e.g., 
as in p]. 
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6 Conclusions 



We presented a dichotomy for Min Ones SAT(r) for finite sets of relations T, assuming that the 
polynomial hierarchy does not collapse. The characterization of the dichotomy is a new concept 
we call mergeability. We showed that Min Ones SAT(r) admits a polynomial kernelization if the 
problem is in V or if every relation in T is mergeable, while in every other case no polynomial 
kernelization is possible unless AfV C co-A/'T'/poly, in which case the polynomial hierarchy 
would collapse to the third level. 

It might be interesting to compare our kernelization dichotomy to the approximation prop- 
erties of Min Ones SAT(r), as characterized by Khanna et al. [H]. The mergeability property 
cuts through the classification of Khanna et al. as follows (we use the terms from B.4 of [14]). 
For every T such that Min Ones SAT(r) is known to be in APX, Min Ones SAT(r) admits 
a polynomial kernelization, while no problem identified as being Min Horn Deletion-complete 
IS merg eablel The remaining classes are cut through (e.g., among the affine relations, the 
relation (x + y + z = 1 (mod 2)) is mergeable, while {x + y + z = (mod 2)) is not). We also 
get kernelizations for some problems where the corresponding SAT problem is TVP-complete, 
e.g., Min Ones Exact Hitting Set for sets of bounded arity, where no approximation is possible 
unless V =MV. 
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